Discriminative Reordering Model for Machine Translation

نویسنده

  • Yang Gao
چکیده

We have built a discriminative reordering model for the phrase-based machine translation system Moses, which is developed at the University of Edinburgh. The model is a maximum entropy classifier which incorporates a variety of feature functions to predict phrase orientation for machine translation. Two kinds of features reported in literature, namely lexical features and dependency path feature have been tested in the discriminative model. We have also proposed and tested a novel feature named dependency orientation and modified the dependency path feature with lexicalization. Two baseline models are used in evaluation, namely, distance-based model without lexicalized reordering, and the lexicalized reordering model. We are able to achieve significant BLEU gains over the distance based model by up to 0.95 absolute points, and BLEU gains over the lexicalized reordering model by up to 0.74 absolute points. Discriminative reordering model is a very generic framework and is open to many more features for further improvement.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discriminative Reordering Extensions for Hierarchical Phrase-Based Machine Translation

In this paper, we propose novel extensions of hierarchical phrase-based systems with a discriminative lexicalized reordering model. We compare different feature sets for the discriminative reordering model and investigate combinations with three types of non-lexicalized reordering rules which are added to the hierarchical grammar in order to allow for more reordering flexibility during decoding...

متن کامل

Discriminative Reordering Model Adaptation via Structural Learning

Reordering model adaptation remains a big challenge in statistical machine translation because reordering patterns of translation units often vary dramatically from one domain to another. In this paper, we propose a novel adaptive discriminative reordering model (DRM) based on structural learning, which can capture correspondences among reordering features from two different domains. Exploiting...

متن کامل

Dynamic distortion in a discriminative reordering model for statistical machine translation

Most phrase-based statistical machine translation systems use a so-called distortion limit to keep the size of the search space manageable. In addition, a distance-based distortion penalty is used as a feature to keep the decoder to translate monotonically unless there is sufficient support for a jump from other features, particularly the language models. To overcome the issue of setting the op...

متن کامل

Large-scale Reordering Model for Statistical Machine Translation using Dual Multinomial Logistic Regression

Phrase reordering is a challenge for statistical machine translation systems. Posing phrase movements as a prediction problem using contextual features modeled by maximum entropy-based classifier is superior to the commonly used lexicalized reordering model. However, Training this discriminative model using large-scale parallel corpus might be computationally expensive. In this paper, we explor...

متن کامل

Inducing a Discriminative Parser to Optimize Machine Translation Reordering

This paper proposes a method for learning a discriminative parser for machine translation reordering using only aligned parallel text. This is done by treating the parser’s derivation tree as a latent variable in a model that is trained to maximize reordering accuracy. We demonstrate that efficient large-margin training is possible by showing that two measures of reordering accuracy can be fact...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010